Switching: understanding the class-reversed sampling in tail sample memorization
نویسندگان
چکیده
Long-tailed visual recognition poses significant challenges to traditional machine learning and emerging deep networks due its inherent class imbalance. Existing reweighting re-sampling methods, although effective, lack a fundamental theory while leaving the paradoxical effects of long tail unsolved, where network failing with head classes under-represented overfitted. In this paper, we investigate long-tailed from memorization-generalization point view, which not only unravels whys previous but also derives new principled solution. Specifically, first empirically identify regularity under distributions, finding that challenge is essentially trade-off between representation high-regularity generalization low-regularity classes. To memorize samples without seriously damaging samples, propose simple yet effective sampling strategy for ordinary mini-batch SGD optimization process, Switching, switches instance-balanced class-reversed once at small rate. By theoretical analysis, show upper bound on error proposed lower than conditionally. our experiments, method can reach feasible performance more efficiently current methods. Further experiments validate superiority Switching strategy, implying could be parsimoniously tackled in memorization stage rate over-exposure samples.
منابع مشابه
Asymptotic properties of the sample mean in adaptive sequential sampling with multiple selection criteria
We extend the method of adaptive two-stage sequential sampling toinclude designs where there is more than one criteria is used indeciding on the allocation of additional sampling effort. Thesecriteria, or conditions, can be a measure of the targetpopulation, or a measure of some related population. We developMurthy estimator for the design that is unbiased estimators fort...
متن کاملEstimating Variance of the Sample Mean in Two-phase Sampling with Unit Non-response Effect
In sample surveys, we always deal with two types of errors: Sampling error and non-sampling error. One of the most common non-sampling errors is nonresponse. This error happens when some sample units are not observed or viewed but they do not answer some of the questions. The complete prevention of this error is not possible, but it can be significantly reduced. The non-response causes bias and ...
متن کاملthe effect analysis of proficiency on language - switching in iranian efl learners writing
abstract in a protocol analysis of second language writing from 20 adult english as a foreign language (efl) iranian students, this research observed how language-switching (l-s), i.e., first language use in l2 writing, was affected by l2 proficiency. switching interactively between first (l1) and second (l2) languages has been recognized as one of the salient characteristics of l2 writing....
15 صفحه اولThe sample size required in importance sampling
The goal of importance sampling is to estimate the expected value of a given function with respect to a probability measure ν using a random sample of size n drawn from a different probability measure μ. If the two measures μ and ν are nearly singular with respect to each other, which is often the case in practice, the sample size required for accurate estimation is large. In this article it is...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Machine Learning
سال: 2022
ISSN: ['0885-6125', '1573-0565']
DOI: https://doi.org/10.1007/s10994-021-06087-3